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^ ■ Abstract 

We study random subgraphs of the n-cube {0,1}", where nearest-neighbor edges are 
occupied with probabihty p. Let Pc{n) be the value of p for which the expected cluster 
size of a fixed vertex attains the value A2"/^, where A is a small positive constant. Let 
e = n{p — pc{n)). In two previous papers, we showed that the largest cluster inside a scaling 
window given by |e| = 0(2^"/^) is of size 0(2^"/^), below this scaling window it is at most 
. 2(log2)ne~^, and above this scaling window it is at most 0(e2"). In this paper, we prove 

. that for p — Pc{n) > e"'^"^''^ the size of the largest cluster is at least 0(e2"'), which is of the 

I same order as the upper bound. This provides an understanding of the phase transition that 

■ goes far beyond that obtained by previous authors. The proof is based on a method that has 

. come to be known as "sprinkling," and relies heavily on the specific geometry of the n-cube. 



1 Introduction and results 



1.1 History 



c3 ■ The study of the random graph G{N,p), defined as subgraphs of the complete graph on vertices 
in which each of the possible (^^^ edges is occupied with probability p, was initiated by Erdos and 
Renyi in 1960 JHl- They showed that for p = A^~^(l + e) there is a phase transition at e = in 
the sense that the size of the largest component is 0(logA^) for e < 0, G(A^) for e > 0, and has 
the nontrivial behavior B(A^^/^) for e = 0. 

The results of Erdos and Renyi were substantially strengthened by Bollobas [S] and Luczak 
|19j . In particular, they showed that the model has a scaling window of width A^~^/^, in the sense 
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that ifp = A^"^(l + AAriV~i/3) ^j^g^ ^j^g 

size of the largest component is 0(iV^/^) when A at remains 
uniformly bounded in N, is less than B(A^^/^) when A^r — > — oo, and is greater than 0(iV^/^) 
when A^v +00. It is also known that inside the scaling window the expected size of the cluster 
containing a given vertex is Q{N^^^). 

The scaling window is further characterized by the emergence of the giant component. When 
p = N~^{1 + AnN''^^^) and Ajv — > +00, then for any constant K > 1 the largest component will be 
almost surely more than K times the size of the second largest component. When A^r — > —00 this 
almost surely does not happen. However, inside the window, with A^v = A fixed, this occurs with 
a limiting probability strictly between zero and one. This is particularly striking with computer 
simulation. For example, for N = 50,000, when Atv = —4 the largest components are all roughly 
the same size but by the "time" Atv = +4 most of them have joined to form a dominant component 
several times larger than its nearest competitor. 

In this paper, we consider random subgraphs of the n-cube Q„, = {0, 1}", where each of 
the nearest-neighbor edges is occupied with probability p. We emphasize the role of the volume 
(number of vertices) of Q„ by writing 

V^=|Qn|=2^ (1.1) 

This model was first analysed in 1979 by Erdos and Spencer p3], who showed that the probability 
that the random subgraph is connected tends to for p < 1/2, for p = 1/2, and 1 for p > 1/2. 
More interestingly for our purposes, they showed that for p = n~^(l + e) the size of the largest 
component is o(y) for e < 0, and conjectured that it is Q(y) for e > 0. 

The conjecture of Erdos and Spencer was proved in 1982 by Ajtai, Komlos and Szemeredi j3], 
who thereby established a phase transition at e = 0. Their results apply for p = n~^{l + e) with e 
fixed. For e fixed and negative, the largest component can be shown by comparison with the Poisson 
branching process to be a.a.s. of size 0{n). (We say that En occurs a.a.s. if lim^^oo IP(-E'n) = 1-) 
On the other hand, for e fixed and positive the largest component is at least of size cV for some 
positive c = c(e). To prove the latter, Ajtai, Komlos and Szemeredi introduced a method, now 
known as "sprinkling," which is very similar to methods introduced at roughly the same time in 
the context of percolation on Z"' by Aizenman, Chayes, Chayes, Frohlich and Russo |2j. We will 
use a variant of sprinkling in this paper. Very recently, Alon, Benjamini and Stacey ^ used the 
sprinkling technique to extend the Ajtai, Komlos and Szemeredi result 0| to subgraphs of transitive 
finite graphs of high girth. These results, while applicable to much more than the n-cube, also 
hold only for e fixed. 

A decade after the Ajtai, Komlos and Szemeredi work, Bollobas, Kohayakawa and Luczak [TH] 
substantially refined their result, in particular studying the behavior of the largest cluster as e — *• 0. 
Let |C(x)| denote the size of the cluster of x, let Cmax denote a cluster of maximal size, and let 

|C„,ax| =max{|C(x)| :xGQ„} (1.2) 

denote the maximal cluster size. We again take p = n~^{l + e), and now assume that e ^ as 
n —>■ 00. In [TUl Corollary 16, Theorem 28] (with somewhat different notation), it is proved that 
for e < — (log?T,)^(loglogr;,)~^n~^/^, 

l^maxl = ^^"f ^ (1 + 0(1)) a.a.s. as n ^ 00, (1.3) 
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and that for e > 60(logn)^n~^, 

|Cmax| = '2eV(l + 0(1)^ a.a.s. as n — > cxo. (1.4) 

Thus, e > 60 (log n)^?^^^ is supercriticaL In addition, it is shown in [TUl Theorem 9] that the right 
side of ()1.3|) is an upper bound on |Cmax| provided that p < {n — — e~°^''^\ and hence such p 
are subcriticaL 

In recent work fTH, IT^ , we developed a general theory of percolation on connected transitive 
finite graphs that applies to Q„ and to various high- dimensional tori. The theory is based on the 
view that the phase transition on many high- dimensional graphs should have similar features to 
the phase transition on the complete graph. In particular, the largest cluster should have size 
@(^2/3-j jj-^ scaling window of width Q(V~^^^), and should have size o(V^/^) below the window 
and size G(V^) above the window. Note that the bounds of ^U], while much sharper than those 
established in j^, are still far from establishing this behavior. 

We will review the results of [TTl [T2] in detail below, as they apply to Q„. These results do 
not give a lower bound on the largest cluster above the scaling window, and our primary purpose 
in this paper is to provide such a bound. We define a critical threshold Pc{n) and prove that the 

1 /3 

largest cluster has size Q{[p ~ Pc{n)]nV) for p — Pc{n) > e"'^" . This falls short of proving a 
bound for all p above a window of width V~^^^, but it greatly extends the range of p covered by 
the BoUobas, Kohayakawa and Luczak bound p.4j) . 



1.2 The critical threshold 

The starting point in jjllj is to define the critical threshold in terms of the susceptibility x{p)j 
which is defined to be the expected size of the cluster of a given vertex: 

x(p)=Ep|C(0)|. (1.5) 

For percolation on Z'^, x{p) diverges to infinity as p approaches the critical point from below. 
On Q„, the function x is strictly monotone increasing on the interval [0,1], with x(0) = 1 and 
x(l) = V. In particular, x{p) is finite for all p. 

For G{N,p), the susceptibility is G(A^^/'^) in the scaling window. For the role of is played 
by y = 2", so we could expect by analogy that Pc{n) for the n-cube should be roughly equal to the 
p that solves xip) = V^^^ = 2"/^. In JI], we defined the critical threshold Pc = Pc{n) = pdn; A) 
by 

xiPc) = XV'/\ (1.6) 

where A is a small positive constant. The flexibility in the choice of A in ()1.6|1 is connected with the 
fact that the phase transition in a finite system is smeared over an interval rather than occurring at 
a sharply defined threshold, and any value in the transition interval could be chosen as a threshold. 
Our results show that Pc defined by ()1.6p really is a critical threshold for percolation on Q„. 

The triangle condition plays an important role in the analysis of percolation on Z'^ for large d 
13 13 Cni, as well as on infinite non-amenable graphs [22] • For x,y & Qn, let {x ^ y} denote the 
event that x and y are in the same cluster, and let Tp{x,y) = Pp(x ^ y). The triangle diagram is 
defined by 

Vp(a;,?/)= J2 Tp{x^w)Tp{w,z)Tp{z,y). (1.7) 
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The triangle condition is the statement that 

max Vp^(n){x,y) < 6cc,y + ao (1-8) 

where is less than a sufficiently small constant. The stronger triangle condition is the statement 
that there are positive constants Ki and K2 such that Vp(x, y) < 5x,y + cto uniformly in p < Pc{n), 
with 

ao = ao(p) = K,n-' + K^x^pW^^- (1.9) 

If we choose A sufficiently small and n sufficiently large, then the stronger triangle condition implies 
the triangle condition. 

In ^2]; we proved the stronger triangle condition for Q„. In |1H Theorems 1.1-1.5], we derived 
several consequences of the stronger triangle condition in the context of general finite connected 
transitive graphs. For the n-cube, these results (see Theorems 1.1, 1.5] imply that there is a 
Ao > and a 60 > (depending on Aq) such that if < A < Aq then 

1 - A"V-^/3 < np^{n) < 1 + bon'\ (1.10) 

In addition, given < Ai < A, let pi be defined by xiPi) = ^iV^^^. Then 

< «(p.(n) -pO < ^^11 +''o"-'|. (1.11) 

Thus decreasing A to Ai shifts pdn) only by 0{n^^V^^^^), so pi remains in the scaling window. 
The asymptotic formula Pc{n) = n^^ + 0{n^'^) of p.lO|) is improved in J7] to 

Pe(n) = i + 1 + -I- + 0{n-'). (1.12) 



n n 
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Presumably there is an asymptotic expansion to all orders, so that there are real numbers ctj (i > 1) 
such that for each s > 1 

p,{n) = ^ a,n'' + 0{n'^'+^^). (1.13) 

i=l 

Except in the unlikely event that = for all i sufficiently large, so that the expansion is actually 
a finite polynomial in n~^, we see from fll.l3|) that for every s > 1, the truncated expansion 
Ylii=i cbin~^ lies outside an interval of width V~^^^ = 2~"/^ centered at pdn), for large n. The non- 
perturbative definition (|1.6p of Pc{n) therefore tracks the scaling window more accurately than 
any polynomial in can ever do. Moreover, as is discussed in more detail in ^Tj, we expect 
that the full expansion J2^i (iin~^ is a divergent series. This would mean that, given n, if we take 
s large depending on n then the truncated series Yli=i (iin~^ would be meaningless — possibly not 
even lying in the interval [0, 1]. 

Bollobas, Kohayakawa and Luczak raised the question of whether the critical value might be 
equal to {n — 1)^^. Note that n — 1 is the forward branching ratio of the "tree approximation" 
to the n-cube, so that this suggestion would mean that the tree approximation would give the 
correct critical value. Note also that, because of the smearing of the random graph critical point 
by the scaling window of width A^~^/^, on the random graph there is no distinction between the 
critical values Pc{N) = {N — and Pc{N) = N~^, so that the the tree approximation does give 
a correct critical value in that case. However, for the n-cube, our picture that the width of the 
scaling window is Q{V^^^^) = 0(2""/'^) implies that there is a real distinction between the values 
n~^ and (n — 1)~^, and p.l2|) implies that both lie below the critical window. 
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1.3 In and around the scaling window 

Given p G [0, 1], let e = e(p) G M be defined by 

p = p^{n) + -. (1.14) 
n 

We say that p is below the window (subcritical) if eV^^^ —oo, above the window (supercritical) 
if eV^^^ — > oo, and inside the window if |e|y^/'^ is uniformly bounded in n. In this section, we 
summarize and rephrase the results stated in ^T] Theorems 1.2-1.5], as they apply to Q„. We have 
stated slightly weaker results than those obtained in ^1], in order to simplify the statements. These 
results where proved in ^1] assuming the triangle condition (or the stronger triangle condition for 
im Theorem 1.5], and the stronger triangle condition was established in J12j. 

Theorem 1.1 (Below the window). Let A < Aq and p = Pc{n) — en^^ with e > 0. Then 

X-W-y^ + e - - A-^V-V3 + [l-ao]e ' ^'-^'^ 
with Qq = o-oip) given by (|1.9|) . Moreover, if eV^^^ — ^ oo as n ^ oo, then 

Xip) = -Jl + 0(1)], (1.16) 
^[1 + 0(1)] < E,(|C_|) < ^^[1 + 0(1)], (1.17) 



lO^e' 



^ .[1 + 0(1)] < IC^axI < ^^^[l + o(l)] a.a.s. (1.1^ 



3600e2 

In the language of critical exponents, the above bounds on x{p) correspond to 7 = 1. Since 
Pc{n) > {n — 1)^^ for large n by ()1.12j) . the upper bound of ()1.17|1 extends the range of p covered 
by the Bollobas, Kohayakawa and Luczak upper bound of from p < (n — — e~°^") to all p 
below the window. We conjecture that the upper bound of p.l7|l is actually sharp for all p that are 
not exponentially close topc(^). This was proved in [lOj (see p.3|) ) for e > (logn)2(loglogn)~^?7,~^. 
We also conjecture that this behavior can be extended appropriately to cover a larger range of e, 
as follows. 

Conjecture 1.2. Let p = Pc{n) — en^^ with lim„^oo e = and lim„^oo ee^" = 00 for every 6 > 0. 
Then 

|Cn,ax| = ^^^[1 + 0(1)] a.a.s. (1.19) 
// we assume instead that lim^^oo e = and lim„^oo e2"/^ = 00, then 

e 

Note that p.2()|l reduces to ()1.19j) when e is not exponentially small. The asymptotic behavior 
p.2()j) . with V replaced by N, is known to apply to the random graph for N~^^^ <^ N{pc —p)<^l; 
see fSl Theorem 5.6]. 

For k>0, let 

P>k = Ki\Cm>k). (1.21) 



|C_| = e(^^^^) a.a.s. (1.20) 
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Theorem 1.3 (Inside the window). Let A < Aq and A < cxd. Let p = pc + Q with \e\ < 
AV^^^^. There are finite positive constants 61, . . . , feg such that the following statements hold. 
i)Ifk< hiV^I^, then 

ii) 

and, if uj>l, then 
Hi) 



b,V^/^ < Ep(lC^axl) < hV^/^ (1.23) 
fJco-'V'/' < |C^,.| < cuV'/') > 1 - ^. (1.24) 



h-jV^'^ < x{p) < hV^^^. (1.25) 

In the above statements, the constants 62 one? 63 can be chosen to be independent of A and A, the 
constants 65 and bg depend on A and not X, and the constants bi, b^, b^ and bj depend on both A 
and A. 

In terms of critical exponents, ()1.22p says that 5 = 2. 
Theorem 1.4 (Above the window). Let A < Aq and p = Pc + e^"^ with eV^^'^ 00. Then 

X{p) < 162eV, (1.26) 



Ep(IC^axl) < 28eV, (1.27) 

and, for all u > 0, 

^ /.^ , \ const 

^p(\Cm..\>u:(^Vj<—^. (1.28) 
A refinement of p.28p will be given in Section ITTl 

To see that there is a phase transition at Pc{n), we need an upper bound on the maximal cluster 

size in the subcritical phase and a lower bound in the supercritical phase. The former is given in 
Theorem 11.11 but the latter is not part of Theorem 11.41 

1.4 Main result 

Our main result is the following theorem, which is proved in Section |21 Theorem 11.51 provides the 

1/3 

missing lower bound for e > e~^^ . This restriction on e is an artifact of our proof and we believe 
the theorem remains valid as long as eV^^^ — > oo; see Coniecture 11.61 To fully establish the picture 
that there is a scaling window of width Q(y~^^^), it would be necessary to extend Theorem II. 51 to 
cover this larger range of e. 

Theorem 1.5. There are c, ci > and Aq > such that the following hold for all < X < Xq and 
all p = Pc + en~^ with < e < I-' 

|Cmax| > Cie2" a.a.s. as n 00, (1.29) 
X{p) > [1 + o(l)](cie)^2" as n-^ 00. (1.30) 
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It is interesting to examine the approach to the critical point with \p — Pc{n)\ of order for 
different values of s. Our results give a hierarchy of bounds as s is varied. For example, it follows 
from Theorem 11.11 that for p = Pc{n) — 6n^^ with s > and 6 > 0, 

x{p) = n^-'6-'[l + 0{n-% (1.31) 

On the other hand, for p = Pc{n) + 5n~^ with s > and 5 > 0, it follows from Theorems 11.41 and 
IT31that 

X{p) = e(5V(^-^)2"). (1.32) 
Related bounds follow for |Cmax|- Thus there is a phase transition on scale n~* for any s > 1. 



1.5 More conjectures 

For e > 60(log?2)^n~^, Bollobas, Kohayakawa and Luczak (TU] proved (see ()1.4p ) that |Cmax| = 
2e2"(l + o(l)) a.a.s. We conjecture that this formula holds for all e above the window. This 
behavior has been proven for the random graph above the scaling window; see fHl Theorem 5.12]. 

Conjecture 1.6. Let p = Pc{n) + en~^ with e > 0, lim„_>oo e = and lim„_>oo eV^^^ = oo. Then 

|C„,ax| =2eV [1 + 0(1)] a.a.s., (1.33) 
x(p)=4eV[l + o(l)]. (1.34) 

The constants in the above conjecture can be motivated by analogy to the Poisson branching 
process with mean A. There the critical point is A = 1 whereas we have a critical point Pc{n). An 
increase in p beyond Pc{n) by en~^ increases the average number of neighbors of a vertex by e, 
which we believe corresponds to the Poisson branching process with mean 1 + e. This process is 
infinite with probability ~ 2e. When we generate the component of x in Q„ it cannot, of course, 
be infinite, but with probability ~ 2e it will not die quickly. Consider components of x,y e Qn 
that do not die quickly. We believe that these components will not avoid each other. Rather, 
all of them will coalesce to form a component Cmax of size 2eV. Finally, with probability ~ 2e 
a given vertex hes in Cmax and this contributes ~ {2e){2eV) to x{p): which we believe is the 
dominant contribution. Note also that for any positive constant e, it is shown in 10, Theorem 29] 
that I Cmax I ~ o,N where a is the probability that the Poisson branching process with mean 1 + e 
is infinite. 

For the random graph G{N, p), outside the scaling window there is an intriguing duality between 
the subcritical and supercritical phases (see [HI Section 10.5], and, for a more general setting, see 
|21j). Let p = A^~^(l+e) lie above the scaling window for G{N,p), and to avoid unimportant issues, 
assume that e = o(l). Almost surely, there is a dominant component of size ~ 2eA^. Remove this 
component from the graph, giving G~ . Then G~ behaves like the random graph in the subcritical 
phase with probability p' = pc{l — e) . In particular, the size of the largest component of G~ (the 
second largest component of G) is given asymptotically by the size of the largest component for 
p'. 

We believe that this duality holds for random subgraphs of Q„ as well. Let C2 denote the 
second largest cluster, and set p = n^^{l + e). Bollobas, Kohayakawa and Luczak JU] showed that 
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if e — > and e > 60{logn)^n ^ then IC2I ~ (21ogy)e ^ a.a.s. Note that this matches the behavior 

of |Cmax| for P = Pc{n) — en~^ given in Conjecture 11.21 The foUowing conjecture, which we are far 

1/3 

from able to show using our present methods even for e > e"*^" , claims an extension of the result 
of to all p above the window. 

Conjecture 1.7. Let p = Pc{n) + tn^^ with e > 0, lim^^oo e = and lim^^oo e^^^^ = 00. Then 
as n ^ 00, the size of the second largest cluster C2 is 

IC2I = ^^^[l + o(l)] a.a.s. (1.35) 

Finally, we consider the largest cluster inside the scaling window. For G{N,p), it is known that 
the largest cluster inside the window has size XN'^^^ where X is a positive random variable with a 
particular distribution. Similarly, we expect that for Q„ the largest cluster inside the window has 
size YV'^^^ for some positive random variable Y. Our current methods are not sufficient to prove 
this. 



2 Proof of Theorem 11.51 

In this section, we prove Theorem 11.51 bv showing that there is a Ci > such that when e~^'^^^^ < 
e<l, 

IC^axI > cie2" a.a.s. (2.1) 

and 

x(p)>(ci6)^2"[l + o(l)]. (2.2) 

The proof of ()2.1|) is based on the method of sprinkling and is given in Sections I2.1H2.2I The 
bound ()2.2|) is an elementary consequence and is given in Section IT^ 

2.1 The percolation probability and sprinkling 

For percolation on Z'^, the value of p for which x{p) becomes infinite is the same as the value 
of p where the percolation probability Pp(|C(0)| = oo) becomes positive jU I^HI- For Q„, there 
can be no infinite cluster, and the definition of the percolation probability must be modified. For 
p = Pc{n) + en~^ with e > 0, we defined the percolation probability in jllj by 

e^{p) = Pp(|C(0)| > iVj = P>^^, (2.3) 

where 

N. = N^ip) = ^{eV'/'y (2.4) 

and a is a fixed parameter in (0,1). The definition ()2.3|) is motivated as follows. According to 
Conjectures [UnHUZl above the window the largest cluster has size |Cmax| = 2eV^[l + o(l)] a.a.s., 
while the second largest has size IC2I = 2e~^logV^[l + o(l)]. According to this, above the window 
IC2I <^ Na ^ |Cmax|, SO that a cluster of size at least Na should in fact be maximal, and Oa{p) 
should correspond to the probability that the origin is in the maximal cluster. (The above reasoning 
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suggests the range < a < 3 rather than < a < 1, but the analysis of [TT] requires the latter 
restriction.) 

Let < a < 1. The combination of Theorem 1.6] with the verification of the triangle 
condition in implies that there are positive constants bg, bio such that 

bioe < e^ip) < 27e, (2.5) 

where the lower bound holds when bgV^^^^ < e < 1 and the upper bound holds when e > V~^^^. 
In addition, there are positive bu, bu such that if max{6i2V^~^'^^, V~'^} < e < 1, where r] = Ifrff ? 
then 

Pp(|C^ax| < [1 + {eV^r'WaipW^ > 1 - p7^^- (2.6) 

In the above statements, the constants 69, 610, bn and 612 depend on both a and A. Note that 
although ()2.6j) does not obtain the precise constant of p.4|l found by Bollobas, Kohayakawa and 
Luczak, it does extend the range of p from p > {n — + 60(log^ n)n~'^ to p > Pc{n) + 2~^' for 
any rj' < rj. Also, note that the combination of ()2.6|) and ()2.5p gives a refinement of ()1.28|) . 
Let 

Z>N^= E n\Cix)\>N^] (2.7) 

denote the number of vertices in "moderately" large components. Then Ep(Z>7Va) = 6a{p)V and 
hence, by (j2.5|l . above the window Kp{Z>N^) = Q{eV). In the proof of Theorem 1.6 ii)], it is 
shown that 

p,(|z>;v„ - ve^{p)\ > (eVT've^ip)) < (2-8) 

for percolation on an arbitrary finite connected transitive graph that obeys the triangle condition, 
and hence ()2.8|1 holds for Q„. For e > e~'^'^^^^ and fixed a G (0, 1), there are therefore positive 
constants ?7i, ?72, A such that 

Fp[\Z>N^ - Veo.{p)\ > V'-'''9„{p)) < AV-^\ (2.9) 

This shows that 2'>jv^ is typically close to its expected value VOaij))- For V sufficiently large and 
< e < 1, it follows from (j2.9j) and the lower bound of (j2.5|) that 

Fp(^>iv„ > 26ioeV) < AV-'^^ (2.10) 

The proof of 1)2.11) is based on the following sketch. Let p > Pc{n) + en' -1 with e--^"^ < e < 1. 
Let p~ = \{p — Pc{n)), and define p^ by p^ + p^ — p^p^ = p. Then a percolation configuration 
with bond density p can be regarded as the union of two independent percolation configurations 
having bond densities p~ and p"*". The additional bonds due to the latter are regarded as having 
been "sprinkled" onto the former. For percolation with bond density p~ , it follows from ()2.10|) that 
a positive fraction of the vertices lie in moderately large components. We then use the specific 
geometry of Q„, in a crucial way, to argue that after a small sprinkling of additional bonds a 
positive fraction of these vertices will be joined together into a single giant component, no matter 

1 /3 

how the vertices in the large components are arranged. Our restriction e > e~'^^ enters in this 
last step. 
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2.2 The largest cluster 

In Proposition 12.51 below, we prove the lower bound on jCmaxI of ()2.1|) . In preparation for Proposi- 
tion EIH we state four lemmas. The first lemma uses a very special geometric property of Q^. For 
its statement, given any X C Q„ and positive integer ci, we denote the ball around X of radius d 
by 

B[X, d] = {y eQn-^x e X such that p(x, y) < d}, (2.11) 
where p(x, y) denotes the graph distance between x and y. 

Lemma 2.1 (Isoperimetric Inequality). If X C and \X\ > J2i<u (") ^hen 

\B[X,d]\> Y: (2.12) 

i<u+d \^/ 

Lemma im is proved in Harper ^^l- Bollobas jH] is a very readable and more modern reference. 
The result of Lemma (2.11 mav be seen to be best possible by taking X = B[{v},d'] for any fixed 
V G Q„, so that B[X, d] = B[{v},d' + d]. For asymptotic calculations we use the inequality of the 
following lemma. 

Lemma 2.2 (Large Deviation). For A > 0, 

E f""! = E < 2"e-^'/2n_ (2.13) 



- 2 



^ / . n + A V * 



Proof. The first two terms are equal by the symmetry of Pascal's triangle. Dividing by 2", the 
inequality may be regarded as the large deviation inequality 

Pr[^„ > A] < e-^'/2n^ (2.14) 

where Sn = J2^=i^i with the Xj independent random variables with P(Xj = ±1) = |. A simple 
proof of this basic inequality is given in |6i Theorem A. 1.1]. □ 

Lemma 2.3 (Big Overlap). Let A,e > satisfy e'^'/^n ^ |^ 5,T C Q„ with \S\, \T\ > e2". 
Then 

\B[S,A]nT\>^\T\. (2.15) 

Proof From Lemma|2Il \S\ > Ei<(n-A)/2 (")• Hence, by LemmaO \B[S,A]\ > Ei<(n+A)/2 (") 
(intuitively, we have crossed the equator). Therefore, by Lemma f2.2[ 

\Qn\B[S,A]\< E^ Q < |2" < ^|T|, (2.16) 



and so B[S, A] must overlap at least half of T. □ 

Lemma 2.4 (Many Paths). Let A, e > satisfy e'^'/^n ^ |^ Let S,T CQ^ with \S\, \T\ > e2". 

1 

2 



Then there is a collection of le2^n vertex disjoint paths from S to T, each of length at most 



A. 
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Proof. Set Ti = B[S, A] fl T. By Lemma ESI l^^il > |e2". Let T2 C Ti be a maximal subset such 
that no X, y G T2 are within distance 2 A of each other. Every y eTi must he in a ball of radius 
2 A around some a; G T2 and each such ball has size at most n?^ (using a crude upper bound), so 
1^2! > '^"^^l^il- For each a; G T2 there is a path of length at most A to some z E S and the paths 
from X, ?/ G T2 must be disjoint as otherwise x^y would be at distance at most 2A. □ 

Now we use Lemma (2.41 and ()2.10|) to prove ()2.1|) . 

Proposition 2.5 (Sprinkling). There are absolute positive constants Ci,(3 such that 

P(|C^ax| < cie2") < 2-^". (2.17) 
whenever e~^'^^''^ < e<l. In particular, |Cmax| > Cie2'^ a.a.s. 
Proof. As usual, we write p = Pc{n) + en~^. Let p~ be such that 

2n 2n 

Note that p~ = Pc{n) + e/ {2n) + o{e/n). We consider the random subgraph of Q„ with probability 
p as the union of the random subgraph G~ with probability p~ and the random subgraph H (the 
sprinkling) with probability e/{2n). Crucially, G~ and H are chosen independently. 
Let Ci {i G /) denote the components of G~ of size at least 2""/3_ g^^ 

D = [jCi and M = \D\, (2.19) 

and note that 

A^^ = > V"/^ = 2""/^ (2.20) 

Since M > Z>n^ by ()2.2()j] . it follows from ()2.1()j) that there is an absolute positive constant C2 
such that 

Fp- (M > C2e2") > Pp- iZ>N^ > C2e2") ^ 1 (2.21) 

exponentially rapidly in n. Thus we may assume that G~ has M > C2e2". It suffices to show 
that the probability that at least |M vertices of D lie in a single component of G~ U H tends 
to 1 exponentially rapidly in n (intuitively, that the sprinkling H joins together the disparate 
components of G~). We will prove this by estimating the complementary probability, which we 
will show is in fact much smaller than exponential in n. 

Suppose that it is not the case that at least |M vertices of D lie in a single component of 
G^ U H (sprinkling fails). Then there exists J ^ I such that the union of Cj over j E J has 
between |M and |M vertices of D. Set K = I\J for convenience and let Cj, Ck denote the union 
of the Ci over i E J, i E K respectively. Then D = Cj U Ck, and each of Cj and Ck has size 
between and |M. Fix such J, K. Critically, there must be no path from Cj to Ck in H. 

Let A be such that e~^^/^" < |c2e, and set C3 = |c2. By Lemma 12.41 there are at least 
C3e2"?T,~^^ disjoint paths from Cj to Ck in Qn, each of length at most A. Each path is in H with 
probability at least (e/2n)^. Disjointness implies independence and the probability that H has 
none of these paths is at most 

1 _ {e/2n)^] ' < exp -e^C3e2"n-3^2-^ . (2.22) 
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The above quantity bounds the probabihty that sprinkhng fails for a particular J, K. Since 
each component Cj is of size at least 2""/^, the number of components |/| is at most 2"(^~"/^\ The 
number of choices for J (and hence K = I\J) is bounded by 2 to this number. Thus the total 
probability that sprinkling fails is bounded from above by 



2 exp 



-e^C3e2"n'=^^2-^ 



exp 



(log2)2"(^-°/3) _gA^^^2 



(2.23) 



e ^"^^^ . It is at this final 



Finally, we make some computations to estimate ()2.23|) . Take e 
stage of the argument that we use cannot do better than this specific form for e — increasing e 
only helps. Then we may take A ~ dr?!'^ . Fix C4 with 1 — a/3 < C4 < 1. We select c appropriately 
small so that > T^^-^^^ . Then e^c^eTrT^^I-^ > 2(^4+°(i))", since the factors csen~^^2-^ are 
absorbed into the o(l). Therefore, as required, ()2.23|) is exponentially small (in fact, doubly so). 
Thus we have shown that the probability that sprinkling fails for a particular J, K is much smaller 
than the reciprocal of the number 21^' of such J, K, and hence a.a.s. the sprinkling succeeds, no 
J, K exist, and there is a component of size at least \M. □ 



2.3 The expected cluster size 

Finally, we prove the lower bound on x{p) stated in ()2.2|1 . Let Ci,C2, . . . ,C2i denote the clusters 
in Q„ arranged in decreasing order: 



|Cl| — |Cjnax| ^ IC2I > • • • , 

with Cj = if there are fewer than i clusters. By translation invariance. 



X{p) = 2-" Ep|C(a;)| = 2"" E E l^^l ^ 



2" 



2""E. 



i=l 



> 2""E, 



|2 



IC, 



max 



By dnzi), 



X{p) > 2-"[cie2"]^Pp(|C^ax| > cie2") = [1 + o(l)](cie)^2-. 



This completes the proof of Theorem 11.51 



(2.24) 



(2.25) 



(2.26) 
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